Adaptive Systems
Review for NeurIPS paper: Robust-Adaptive Control of Linear Systems: beyond Quadratic Costs
Summary and Contributions: Post-rebuttal: I would like to thank the authors for their response. As stated in the original review, I think comparing to DQN will improve the paper. This paper address the problem of robust control of continuous dynamic systems, where the system's dynamics is unknown but assumed to have a linear structure, with external polytopic disturbance. The proposed approach consists of several steps for each action, first model and confidence region estimation (or refinement), then worst case reward extraction and state estimation bounds, a conservative planning step based on the reward and state bounds, finally one step execution, and repeating the process in an MPC like manner. The paper presents an end to end approach to the robust control problem for unknown dynamics (only the system dynamic matrix is unknown) in an adaptive manner.
- Information Technology > Control Systems > Adaptive Systems (0.40)
- Information Technology > Artificial Intelligence > Representation & Reasoning (0.38)
Reviews: Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator
The central element of the paper is a (novel) algorithm that utilizes a convex optimization approach (the so-called System Level Synthesis approach, SLS) for synthesizing LQR controllers using estimated dynamics models. The SLS approach allows for an analysis of how the error in the matrix estimation affects the regret of the LQR controller. Using this controller synthesis, upper bounds on the estimation error of the dynamics matrices as well as upper and lower bounds for the expected loss are provided. The method is compared to existing approaches on a benchmark system. This computational study shows a comparable performance of all methods, with the presented method giving the nicest theoretical guarantees (e.g.
- Information Technology > Control Systems > Adaptive Systems (0.41)
- Information Technology > Artificial Intelligence > Machine Learning (0.40)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.38)
Adaptive Control based Friction Estimation for Tracking Control of Robot Manipulators
Huang, Junning, Tateo, Davide, Liu, Puze, Peters, Jan
Adaptive control is often used for friction compensation in trajectory tracking tasks because it does not require torque sensors. However, it has some drawbacks: first, the most common certainty-equivalence adaptive control design is based on linearized parameterization of the friction model, therefore nonlinear effects, including the stiction and Stribeck effect, are usually omitted. Second, the adaptive control-based estimation can be biased due to non-zero steady-state error. Third, neglecting unknown model mismatch could result in non-robust estimation. This paper proposes a novel linear parameterized friction model capturing the nonlinear static friction phenomenon. Subsequently, an adaptive control-based friction estimator is proposed to reduce the bias during estimation based on backstepping. Finally, we propose an algorithm to generate excitation for robust estimation. Using a KUKA iiwa 14, we conducted trajectory tracking experiments to evaluate the estimated friction model, including random Fourier and drawing trajectories, showing the effectiveness of our methodology in different control schemes.
- Information Technology > Control Systems > Adaptive Systems (1.00)
- Information Technology > Artificial Intelligence > Robots (1.00)
Adaptive control of reaction-diffusion PDEs via neural operator-approximated gain kernels
Bhan, Luke, Shi, Yuanyuan, Krstic, Miroslav
Neural operator approximations of the gain kernels in PDE backstepping has emerged as a viable method for implementing controllers in real time. With such an approach, one approximates the gain kernel, which maps the plant coefficient into the solution of a PDE, with a neural operator. It is in adaptive control that the benefit of the neural operator is realized, as the kernel PDE solution needs to be computed online, for every updated estimate of the plant coefficient. We extend the neural operator methodology from adaptive control of a hyperbolic PDE to adaptive control of a benchmark parabolic PDE (a reaction-diffusion equation with a spatially-varying and unknown reaction coefficient). We prove global stability and asymptotic regulation of the plant state for a Lyapunov design of parameter adaptation. The key technical challenge of the result is handling the 2D nature of the gain kernels and proving that the target system with two distinct sources of perturbation terms, due to the parameter estimation error and due to the neural approximation error, is Lyapunov stable. To verify our theoretical result, we present simulations achieving calculation speedups up to 45x relative to the traditional finite difference solvers for every timestep in the simulation trajectory.
- Information Technology > Control Systems > Adaptive Systems (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Synchronisation-Oriented Design Approach for Adaptive Control
Cho, Namhoon, Lee, Seokwon, Shin, Hyo-Sang
This study presents a synchronisation-oriented perspective towards adaptive control which views model-referenced adaptation as synchronisation between actual and virtual dynamic systems. In the context of adaptation, model reference adaptive control methods make the state response of the actual plant follow a reference model. In the context of synchronisation, consensus methods involving diffusive coupling induce a collective behaviour across multiple agents. We draw from the understanding about the two time-scale nature of synchronisation motivated by the study of blended dynamics. The synchronisation-oriented approach consists in the design of a coupling input to achieve desired closed-loop error dynamics followed by the input allocation process to shape the collective behaviour. We suggest that synchronisation can be a reasonable design principle allowing a more holistic and systematic approach to the design of adaptive control systems for improved transient characteristics. Most notably, the proposed approach enables not only constructive derivation but also substantial generalisation of the previously developed closed-loop reference model adaptive control method. Practical significance of the proposed generalisation lies at the capability to improve the transient response characteristics and mitigate the unwanted peaking phenomenon at the same time.
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > Switzerland (0.04)
- Europe > France > Provence-Alpes-Côte d'Azur > Alpes-Maritimes > Nice (0.04)
- (7 more...)
- Information Technology > Control Systems > Adaptive Systems (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Machine Learning (1.00)
- Information Technology > Artificial Intelligence > Robots (0.86)
Adaptive Control of Euler-Lagrange Systems under Time-varying State Constraints without a Priori Bounded Uncertainty
Sankaranarayanan, Viswa Narayanan, Satpute, Sumeet Gajanan, Roy, Spandan, Nikolakopoulos, George
In this article, a novel adaptive controller is designed for Euler-Lagrangian systems under predefined time-varying state constraints. The proposed controller could achieve this objective without a priori knowledge of system parameters and, crucially, of state-dependent uncertainties. The closed-loop stability is verified using the Lyapunov method, while the overall efficacy of the proposed scheme is verified using a simulated robotic arm compared to the state of the art.
- Information Technology > Artificial Intelligence > Robots (0.53)
- Information Technology > Control Systems > Adaptive Systems (0.40)
A Neural Net Model for Adaptive Control of Saccadic Accuracy by Primate Cerebellum and Brainstem
Accurate saccades require interaction between brainstem circuitry and the cerebeJJum. A model of this interaction is described, based on Kawato's principle of feedback-error-Iearning. In the model a part of the brainstem (the superior colliculus) acts as a simple feedback controJJer with no knowledge of initial eye position, and provides an error signal for the cerebeJJum to correct for eye-muscle nonIinearities. This teaches the cerebeJJum, modelled as a CMAC, to adjust appropriately the gain on the brainstem burst-generator's internal feedback loop and so alter the size of burst sent to the motoneurons. With direction-only errors the system rapidly learns to make accurate horizontal eye movements from any starting position, and adapts realistically to subsequent simulated eye-muscle weakening or displacement of the saccadic target.
- Information Technology > Control Systems > Adaptive Systems (0.40)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)
Fast, Robust Adaptive Control by Learning only Forward Models
A large class of motor control tasks requires that on each cycle the con(cid:173) troller is told its current state and must choose an action to achieve a specified, state-dependent, goal behaviour. This paper argues that the optimization of learning rate, the number of experimental control deci(cid:173) sions before adequate performance is obtained, and robustness is of prime importance-if necessary at the expense of computation per control cy(cid:173) cle and memory requirement. This is motivated by the observation that a robot which requires two thousand learning steps to achieve adequate performance, or a robot which occasionally gets stuck while learning, will always be undesirable, whereas moderate computational expense can be accommodated by increasingly powerful computer hardware. It is not un(cid:173) reasonable to assume the existence of inexpensive 100 Mflop controllers within a few years and so even processes with control cycles in the low tens of milliseconds will have millions of machine instructions in which to make their decisions. This paper outlines a learning control scheme which aims to make effective use of such computational power.
- Information Technology > Artificial Intelligence > Machine Learning (0.60)
- Information Technology > Control Systems > Adaptive Systems (0.40)
High Order Robust Adaptive Control Barrier Functions and Exponentially Stabilizing Adaptive Control Lyapunov Functions
In practice, however, many safetycritical stability and safety of nonlinear control systems has received constraints have relative degrees larger than one (e.g., significant attention in recent years. In particular, the unification constraints on the configuration of a mechanical system of Control Lyapunov Functions (CLFs) [1], [2] and generally have at least relative degree two). The unification Control Barrier Functions (CBFs) [3], [4] has provided a of CLFs/CBFs with techniques from CL adaptive control was pathway towards safe and stable control of complex nonlinear also presented in [23], [24]; however, the resulting CLF controllers systems such as autonomous vehicles [5], [6], multiagent either only guarantee uniformly ultimately bounded systems [7], and bipedal robots [8]. Although powerful, stability or are limited to single-input feedback linearizable the guarantees afforded by these approaches are modelbased, systems. Importantly, the CL-based aCBF controllers from hence the success in transferring such guarantees [23], [24] do not provide strong safety guarantees since the to real-world systems is inherently tied to the fidelity of CBF-based control inputs are generated using the estimated the underlying system model. Inevitably, such models are dynamics without accounting for estimation errors, leading only an approximation of the true system due to parametric to potential safety violations that can be understood through uncertainties and unmodeled dynamics, thus there is strong the notion of input-to-state-safety [25].
- North America > United States > Massachusetts > Suffolk County > Boston (0.04)
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
- Information Technology > Artificial Intelligence > Robots (1.00)
- Information Technology > Control Systems > Adaptive Systems (0.99)
Identification and Adaptive Control of Markov Jump Systems: Sample Complexity and Regret Bounds
Sattar, Yahya, Du, Zhe, Tarzanagh, Davoud Ataee, Balzano, Laura, Ozay, Necmiye, Oymak, Samet
Learning how to effectively control unknown dynamical systems is crucial for intelligent autonomous systems. This task becomes a significant challenge when the underlying dynamics are changing with time. Motivated by this challenge, this paper considers the problem of controlling an unknown Markov jump linear system (MJS) to optimize a quadratic objective. By taking a model-based perspective, we consider identification-based adaptive control for MJSs. We first provide a system identification algorithm for MJS to learn the dynamics in each mode as well as the Markov transition matrix, underlying the evolution of the mode switches, from a single trajectory of the system states, inputs, and modes. Through mixing-time arguments, sample complexity of this algorithm is shown to be $\mathcal{O}(1/\sqrt{T})$. We then propose an adaptive control scheme that performs system identification together with certainty equivalent control to adapt the controllers in an episodic fashion. Combining our sample complexity results with recent perturbation results for certainty equivalent control, we prove that when the episode lengths are appropriately chosen, the proposed adaptive control scheme achieves $\mathcal{O}(\sqrt{T})$ regret, which can be improved to $\mathcal{O}(polylog(T))$ with partial knowledge of the system. Our proof strategy introduces innovations to handle Markovian jumps and a weaker notion of stability common in MJSs. Our analysis provides insights into system theoretic quantities that affect learning accuracy and control performance. Numerical simulations are presented to further reinforce these insights.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East > Jordan (0.04)
- South America > Brazil (0.04)
- (3 more...)
- Energy (0.93)
- Banking & Finance > Economy (0.45)
- Information Technology > Control Systems > Adaptive Systems (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)